CLAIM AMENDMENTS 



Claim Amendment Summary 
Claims pending 

• Before this Amendment: Claims 1-15 and 17-44, 

• After this Amendment: Claims 1-15 and 1 7-44 
Non-Elected, Canceled, or Withdrawn claims: None 
Amended claims: 1 - 1 4, 1 7-25, 34, 35, and 40 

New claim: 45 

Claims: 

1. (Currently Amended) A processor-readable medium comprising 
processor-executable instructions for personalizing karaoke, the processor-executable 
instructions comprising instructions for performing a method, the method comprising : 

segmenting visual content to produce a plurality of sub-shots , wherein the 
instructions for segmenting visual content segment video : 

segmenting music to produce a plurality of music sub-clips, wherein the 
segmenting establishes boundaries between the music sub-clips at beat positions within 
the music; 

selecting: important sub-shots from within the plurality of sub-shots: 

selecting sub-shots such that they are uniformly distributed within the video: and 
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displaying at least some of the plurality of sub-shots as a background to lyrics 
associated with the plurality of music sub-clips. 



2. (Currently Amended) The processor-readable medium as recited in claim 
1, additionally comprising instructions for: . wherein the method further comprises 
shortening some of the plurality of sub-shots to a length of a corresponding music sub- 
clip from within the plurality of music sub-clips. 

3. (Currently Amended) The processor-readable medium as recited in claim 
1, wherein segmenting the visual content comprises instructions for : 

dividing a shot into two sub-shots at a maximum peak of a frame difference curve; 

and 

repeating the dividing to result in sub-shots shorter than a maximum sub-shot 

length. 

4. (Currently Amended) The processor-readable medium as recited in claim 
1, additionally comprising instructions for wherein the method further comprises : 

filtering the plurality of sub-shots according to importance; and 
filtering the plurality of sub-shots according to quality. 
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5. (Currently Amended) The processor-readable medium as recited in claim 
4, wherein filtering the plurality of sub-shots according to quality comprises instructions 



examining color entropy within each of the plurality of sub-shots for indications of 
diffusion of color; and 

if color entropy is low, analyzing each of the plurality of sub-shots to detect 
motion more than a threshold indicating interest and less than a threshold indicating low 
camera and/or object movement; 

selecting sub-shots having acceptable motion and/or color entropy scores. 

6. (Currently Amended) The processor-readable medium as recited in claim 
4, wherein filtering the plurality of sub-shots according to importance comprises 
instructions for ; 

evaluating frames within a sub-shot according to attention indices; and 
averaging the attention indices for the frames to determine if the sub-shot should 
be included or excluded. 

7. (Currently Amended) The processor-readable medium as recited in claim 
4, wherein filtering the sub-shots according to importance comprises instructions for : 

analyzing for camera motion, for object motion and for specific objects within the 
sub-shots; 

filtering the sub-shots according to the analysis. 

Serial No.: 10/723,049 _ 

Atty Docket No.: M51-1744US - - lee@hayeS The Business of [P™ 

Atty/Agent: Bea Koempel-Triomas wvAvinohawseoin soiom-ess 



for: 




8. (Currently Amended) The processor-readable medium as recited in claim 
1 , wherein each sub-shot comprises a segment of video of at least a predetermined length 
based on a length of the music sub-clips and segmented based on a magnitude of 
difference between adjacent frames the instructions for segmenting visual - content 
segment video . 

9. (Currently Amended) The processor-readable medium as recited in claim 
[[8]] I, wherein the sub-shots being displayed are selected important and uniformly 
distributed sub-shots and displaying the sub-shots preserves a storyline as represented by 
the visual conten t additionally ' comprising instructions for: 

selecting important - sub - shots - from - wi ^ th^ ^ lHheiDlur - alj ■ t ■ > , of sub shots; and 
selecting sub shots such that they are uniformly di s tributed within the video . 

10. (Currently Amended) The processor-readable medium as recited in claim 
[[9]] I, wherein selecting important sub-shots comprises in s t - ruetion s- foi " 

evaluating color entropy, camera motion, object motion and object detection; and 
selecting the important sub-shots based on the evaluation. 

11. (Currently Amended) The processor-readable medium as recited in claim 
[[9]] I, wherein selecting uniformly distributed sub-shots comprises inst - r - uction s- for : 
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evaluating normalized entropy of the sub-shots along a time line of video from 
which the sub-shots were obtained. 



12. (Currently Amended) The processor-readable medium as recited in claim 
1, wherein the instructions for segmenting visual content includes instructions for 
comprises assigning photographs to be sub-shots. 

13. (Currently Amended) The processor-readable medium as recited in claim 
12, wherein the- instructions for assigning photographs includes instructions for to be sub- 
shots comprises : 

rejecting photographs having problems with quality; and 

rejecting photographs within a group of very similar photographs wherein a photo 
within the group has been selected. 

14. (Currently Amended) The processor-readable medium as recited in claim 
12, wherein the instructions for assigning photographs includes instructions for to be sub- 
shots comprises : 

converting at least one of the photographs to video. 

15. (Original) The processor-readable medium as recited in claim 1, wherein 
the visual content comprises home video and photographs in digital formats. 
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16. (Canceled) 



17. (Currently Amended) The processor-readable medium as recited in claim 
1 , wherein segmenting music into the plurality of music sub-clips comprises instructions 
fer- bounding music sub-clip length according to: 

minimum length = min{max{2* tempo,2},4} and 
maximum length = minimum + 2. 

18. (Currently Amended) The processor-readable medium as recited in claim 
1 , wherein segmenting the music comprises instructions for : 

establishing music sub-clips" length within a range of 3 to 5 seconds. 

19. (Currently Amended) The processor-readable medium as recited in claim 
18, wherein segmenting the music comprises instructions for : 

establishing boundaries for the music sub-clips at sentence breaks. 

20. (Currently Amended) The processor-readable medium as recited in claim 
1. additionally comprising instructions for wherein the method further comprises : 

obtaining the lyrics from a file; and 

coordinating delivery of the lyrics witii die music using timing information 
contained within the file. 
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21. (Currently Amended) A processor-readable medium as recited in claim 
20, wherein obtaining the lyrics comprises instructions for sending the file over a network 
to a karaoke device as a part of a pay-for-play service. 



22. (Currently Amended) The processor-readable medium as recited in claim 
1 , additionally comprising instructions fo r wherein the method further comprises : 

querying a database of songs by humming a portion of a desired song; and 
selecting the desired song from among a number of possibilities suggested by an 
interface to the database. 

23. (Currently Amended) A processor-readable medium comprising 
processor-executable instructions for integrating lyrics, music and video content suitable 
for karaoke, the processor-executable instructions comprising instructions fo r performing 
a method, the method comprising : 

receiving a request for a file associated with a specified song, wherein the file[[;]] 
comprises[[,]]: 

music, lyrics, and timing values associated with the lyrics; 
fulfilling the request for the file by sending the file associated with the specified 

song; 

segmenting the music to produce a plurality of music sub-clips, wherein the 
segmenting establishes boundaries between the music sub-clips at beat positions within 
the music; 
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segmenting visual content to produce a plurality of sub-shots of a length 
corresponding music sub-clips from the plurality of music sub-clips , such that the 
plurality of sub-shots are uniformly distributed within the visual content ; and 

outputting the plurality of music sub-clips together with corresponding sub-shots 
of visual content, wherein the visual content is configured as a background to the lyrics 
associated with the music sub-clips. 

24. (Currently Amended) A processor-readable medium as recited in claim 
23, wherein obtaining the lyrics comprises instructions for sending the file over a network 
to a karaoke device. 

25. (Currently Amended) A personalized karaoke device, comprising: 

a music analyzer configured to segment music to produce a plurality of music sub- 
clips, wherein the segmenting establishes boundaries between the music sub-clips at beat 
positions within the music of a song; 

a visual content analyzer configured to define and select visual content sub-shots, 
wherein the visual content analyzer is configured to select sub-shots of greater 
importance consistent with creating a uniform distribution of the sub-shots over a runtime 
of a source video ; 

a lyric formatter configured to time delivery of syllables of lyrics of the song; and 
a composer configured to; 
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assemble the music-sub clips with the visual content sub-shots;— and 
configured to 

adjust length of the sub-shots to correspond to the music sub-clips : and ^and 
configured to 

superimpose the syllables of the lyrics of the song over the sub-shots, 

26. (Original) The personalized karaoke device of claim 25, wherein the music 
analyzer is configured to segment the song with a strong onset between each of the music 
sub-clips. 

27. (Original) The personalized karaoke device of claim 25, wherein the music 
analyzer is configured to segment the song with a beat between each of the music sub- 
clips. 

28. (Original) The personalized karaoke device of claim 25, wherein the music 
analyzer is configured to segment the song automatically into sub-clips, each having a 
duration that is a function of song tempo. 

29. (Original) The personalized karaoke device of claim 25, wherein the visual 
content analyzer is configured to segment video into sub-shots. 
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30. (Original) The personalized karaoke device of claim 25, wherein the visual 
content analyzer is configured to access folders of home video and photographs 
containing content from which the sub-shots are derived. 

31. (Original) The personalized karaoke device of claim 25, wherein the visual 
content analyzer is configured to assemble still photographs, each of which is a sub-shot. 

32. (Original) The personalized karaoke device of claim 25, wherein the visual 
content analyzer is configured to select from among sub-shots according to ranked 
importance, wherein importance is gauged by detection of color entropy, detection of 
object motion within the sub-shot, detection of camera motion during the sub-shot, and/or 
detection of a face within the sub-shot. 

33. (Original) The personalized karaoke device of claim 25, wherein the visual 
content analyzer is configured to filter out sub-shots having low image quality as 
measured by low entropy and low motion intensity. 

34. (Currently Amended) The personalized karaoke device of claim 25, 
wherein the visual content analyzer is configured to define select sub-shots from visual 
content comprising photographic and video content of greater - importance consistent with 
creating a uniform distribution of the sub shots over a runtime of a source video . 
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35. (Currently Amended) The personalized karaoke device of claim [[25]] 34, 
wherein the visual content analyzer is configured to reject photographs of low quality by 
detecting over and under exposure, overly homogeneous images and blurred images. 

36. (Original) The personalized karaoke device of claim 25, wherein the visual 
content analyzer is configured to organize photographs by date of exposure and by scene, 
thereby obtaining photographs having a relationship. 

37. (Previously Presented) The personalized karaoke device of claim 36, 
wherein the visual content analyzer is configured to reject photographs which are 
members within a group of very similar photographs, wherein one of the group has 
already been selected. 

38. (Original) The personalized karaoke device of claim 25, wherein the visual 
content analyzer is configured to: 

detect an attention area within a photograph; and 

create a photo to video sub-shot based on the attention area, wherein the video 
includes panning and/or zooming. 

39. (Original) The personalized karaoke device of claim 25, wherein the lyric 
formatter is configured to consume a file detailing timing of each syllable and each 
sentence of the lyrics. 
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40. (Currently Amended) An apparatus[[,]] comprising: 

means for creating music sub-clips by segmenting the music to define boundaries 

between the music sub-clips at beat positions within a song; 

means for defining and selecting visual content sub-shots , such that the sub-shots 

are uniformly distributed within the visual content ; 

means for timing delivery of syllables of lyrics of the song; and 

means for assembling the music sub-clips with the visual content sub-shots, 
and to adjust length of the sub-shots to correspond to length of the music sub- 
clips, and to superimpose the syllables of the lyrics of the song over the sub-shots, 

41. (Original) The apparatus of claim 40, wherein the means for defining and 
selecting visual content sub-shots is a video analyzer configured to segment video into 
sub-shots. 

42. (Original) The apparatus of claim 40, wherein the means for defining and 
selecting visual content sub-shots is a video analyzer configured to access folders of 
home video and photographs containing content from which the sub-shots are derived. 

43. (Original) The apparatus of claim 40, wherein the means for defining and 
selecting visual content sub-shots is a video analyzer configured for: 

detecting an attention area within a photograph; and 
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creating a photo to video sub-shot based on the attention area, wherein the video 
includes panning and zooming. 

44. (Original) The apparatus of claim 40, wherein the means for timing 
delivery of syllables of lyrics of the song is a lyric formatter configured for consuming a 
file detailing timing of each syllable and each sentence of the lyrics and for rendering the 
lyrics syllable by syllable. 

45. (New) The apparatus of claim 40 further comprising: 

means for displaying assembled visual content comprising sub-shots with music 
sub-clips; and 
wherein: 

the means for defining and selecting visual content sub-shots, such that the 
sub-shots are uniformly distributed within the visual content is further configured 
for selecting uniformly distributed sub-shots via evaluating normalized entropy of 
the sub-shots along a time line of visual content from which the sub-shots were 
obtained; and 

the means for displaying the assembled visual content comprising sub-shots 
with music sub-clips is configured such that displaying the assembled visual 
content preserves a storyline as represented by the visual content. 
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